Development of a Robust Data Mining Method Using CBFS and RSM
نویسندگان
چکیده
Data mining (DM) has emerged as one of the key features of many applications on information system. While Data Analysis (DA) represents a significant advance in the type of analytical tools currently available, there are limitations to its capability. In order to address one of the limitations on the DA capabilities of identifying a causal relationship, we propose an integrated approach, called robust data mining (RDM), which can reduce dimensionality of the large data set, may provide detailed statistical relationships among the factors and robust factor settings. The primary objective of this paper is twofold. First, we show how DM techniques can be effectively applied into a wastewater treatment process design by applying a correlation-based feature selection (CBFS) method. This method may be far more effective than any other methods when a large number of input factors are considered on a process design procedure. Second, we then show how DM results can be integrated into a robust design (RD) paradigm based on the selected significant factors. Our numerical example clearly shows that the proposed RDM method can efficiently find significant factors and the optimal settings by reducing dimensionality.
منابع مشابه
The Role of Feature Selection with Applications to Eye Movements using Electrooculography
Eyes are the windows to the brain and the eye movements are a rich source of information in information processing. The aim of this paper is to select the features with CBFS Feature selection algorithm using eye movements by ElectroOculoGraph (EOG) signals during reading and writing task. The objective is to impart the fundamental functionality to get an extensive understanding of how EOG signa...
متن کاملTesting the Exactitude of Estimation Methods in the Presence of Outliers: An accounting for Robust Kriging
Estimation of gold reserves and resources has been of interest to mining engineers and geologists for ages. The existence of outlier values shows the economic part of the deposits subject to the fact that don’t depend on the human or technical errors. The presence of these high values causes a pseudo dramatically increment in variance estimation of economical blocks when applying conventional m...
متن کاملThe Use of Robust Factor Analysis of Compositional Geochemical Data for the Recognition of the Target Area in Khusf 1:100000 Sheet, South Khorasan, Iran
The closed nature of geochemical data has been proven in many studies. Compositional data have special properties that mean that standard statistical methods cannot be used to analyse them. These data imply a particular geometry called Aitchison geometry in the simplex space. For analysis, the dataset must first be opened by the various transformations provided. One of the most popular of the a...
متن کاملObject-Oriented Method for Automatic Extraction of Road from High Resolution Satellite Images
As the information carried in a high spatial resolution image is not represented by single pixels but by meaningful image objects, which include the association of multiple pixels and their mutual relations, the object based method has become one of the most commonly used strategies for the processing of high resolution imagery. This processing comprises two fundamental and critical steps towar...
متن کاملIron leaching from bauxite ore in hydrochloric acid using response surface methodology
In this work, hydrochloric acid is used to remove iron impurities in the bauxite ore contained in the diasporite mineral located in the Sari region. The bauxite ore was calcined at different temperatures and times, and then dissolved in a hydrochloric acid solution. After determining the optimum calcination conditions in 1 h at 900 °C, the response surface methodology (RSM) with four factors in...
متن کامل